
AGGCTGTCCCACCCACCATCTGCACCCGCTGCAGCGCCCGCGCCCCTGTCCCGCGCCGTAGTCGTCATTTGTAGCCC 
GCCTGCCGCTCCCGGGGACCCGATCCTACCCTGGGTGCGGGGCAGAGCGGGCATGGCCCGTCTGGGGACCGCCTGCC 
CTGCGCTGGCGCTGGCCCTGGCACTTGTGGCGGTGGCCCTGGCTGGAGTCAGAGCCCAGGGCGCAGCCTTCGAGGAG 
CCTGACTATTACAGCCAGGAGCTCTGGCGGCGCGGGCGCTATTATGGGCATCCGGAGCCTGAGCCGGAGCCGGAGCT 
CTTCTCGCCTTCAATGCATGAAGACCTTAGGGTGGAGGAGCAGGAACAGCAGGAGCCGCACCAGCAGGGCCACAGAA 
CTCCCAAGAAGGCCATCAAGCCCAAGAAGGCTCCCAAGAGGGAGAAGTTAGTTGCAGAGACGCCTCCACCAGGTAAA 
AATAGCAACAGAAAAGGCAGAAGAAGCAAGAATCTTGAGAAAGCTGCCAGTGATGACCATGGTGTCCCTGTGGCTCA 
TGAGGATGTCAGAGAGAGTTGCCCACCTCTTGGTCTGGAAACATTAAAAATCACAGACTTCCAGCTGCATGCCTCCA 
CATCGAAGCGTTATGGCCTGGGAGCCCACCGGGGGAGACTCAACATCCAGGCAGGCATTAATGAAAATGACTTTTAC 
GATGGGGCTTGGTGTGCTGGTAGGAACGACTTGCATCAGTGGATCGAAGTGGATGCCCGGCGCCTGACCAAGTTCAC 
AGGGGTCATTACCCAAGGAAGGAACTCTCTCTGGCTGAGTGACTGGGTGACATCCTATAAAGTCATGGTGAGCAATG 
AC AGC CACACATGGG T T AC TGTG AAG AAT GG AT C TGG C G AC AT G AT ATTTG AAG G AAAC AGTG AG AAG G AG ATTC C T 
GTGCTCAATGAGCTGCCAGTCCCCATGGTGGCCCGCTACATTCGCATAAACCCTCAGTCCTGGTTTGATAACGGGAG 
CATCTGCATGAGGATGGAGATCTTGGGCTGCCCACTGCCGGATCCTAATAACTATTATCACCGACGTAATGAGATGA 
CCACCACGGATGACCTGGATTTTAiVGCACCACAACTATAAGGAAATGCGCCAGTTGATGAAGGTTGTCAATGAAATG 
TGCCCCAATATTACCAGGATTTACAACATTGGCAAAAGCCACCAGGGCCTGAAATTGTATGCGGTAGAGATCTCTGA 
CCATCCTGGGGAACATGAAGTTGGTGAGCCCGAGTTCCACTACATCGCAGGGGCCCACGGCAATGAGGTTCTGGGAC 
GAGAACTGCTGCTGCTGCTGCTGCACTTCCTCTGCCAGGAATACTCGGCGCAGAACGCACGCATCGTCCGCTTGGTG 
GAGGAGACTCGAATCCACATTCTACCCTCCCTCAATCCTGATGGCTATGAGAAGGCCTATGAAGGAGGTTCCGAGTT 
GGGAGGCTGGTCCCTGGGACGTTGGACCCATGATGGCATCGATATCAACAACAACTTTCCGGATTTAAACTCGCTGC 
TCTGGGAGGCAGAGGACCAGCAGAATGCCCCAAGGAAGGTCCCCAACCACTACATTGCCATCCCTGAGTGGTTTCTG 
TCTGAGAATGCCACAGTGGCCACAGAGACCAGAGCCGTCATCGCCTGGATGGAGAAGATCCCGTTTGTGCTGGGAGG 
CAACCTACAGGGGGGTGAGCTGGTCGTGGCATACCCCTATGACATGGTGCGGTCCCTGTGGAAGACCCAGGAGCACA 
CCCCAACACCTGATGATCATGTGTTCCGCTGGCTGGCGTATTCCTACGCCTCCACTCACCGCCTCATGACAGATGCC 
AGGAGGCGAGTGTGCCACACGGAAGATTTTCAGAAGGAGGAGGGCACCGTCAATGGGGCTTCCTGGCACACAGTGGC 
TGGAAGTCTAAACGATTTCAGCTACCTCCATACAAACTGCTTTGAGCTGTCCATCTACGTGGGCTGTGATAAATACC 
CACACGAGAGCGAGCTGCCGGAGGAATGGGAGAATAACCGGGAGTCTCTGATTGTGTTCATGGAGCAGGTTCATCGA 
GGCATCAAAGGCATAGTGAGAGATTTACAAGGGAAAGGGATTTCAAATGCTGTCATCTCTGTGGAAGGTGTTAACCA 
TGACATCCGGACAGCCAGCGATGGGGATTACTGGCGTCTACTGAACCCTGGCGAATATGTGGTCACAGCCAAGGCGG 
AAGGCTTTATCACTTCCACCAAGAACTGCATGGTTGGCTATGATATGGGAGCTACTCGGTGTGACTTCACCCTCACA 
AAGACCAACCTGGCTAGGATAAGAGAAATTATGGAGACATTTGGGAAGCAGCCTGTCAGCCTACCCTCCAGGCGCCT 
GAAGCTGCGGGGACGGAAAAGGCGGCAGCGTGGGTGACCCTGTCGGACACTTGAGACATACCCCAGACCGTGCAAAT 
AAAAATCC AC TC C AGT AGTAAAAAAA 
(SEQ ID NO: 1) 



MARLGTACPALALALALVAVALAGVRAQGAAFEEPDYYSQELWRRGRYYGHPEPEPEPELFSPSMHEDLR 
VEEQEQQEPHQQGHRTPKKAIKPKKAPKREKLVAETPPPGKNSNRKGRRSKNLEKAASDDHGVPVAHEDV 
RESCPPLGLETLKITDFQLHASTSKRYGLGAHRGRLNIQAGINENDFYDGAWCAGRNDLHQWIEVDARRL 
TKFTGVITQGRNSLWLSDWTSYKVWSNDSHTW^ 

IN PQ S WFDNGS I CMRME I LGC P L PD PNNYYHRRNEMTTTDDLDFKHHNYKEMRQLMK WNEMC PN I TRI Y 
NIGKSHQGLKLYAVEISDHPGEHEVGEPEFHYIAGAHGNEVLGRELLLLLLHFLCQEYSAQNARIVRLVE 
ETRIHILPSLNPDGYEKAYEGGSELGGWSLGRWTHDGIDINNNFPDLNSLLWEAEDQQNAPRKVPNHYIA 
IPEWFLSENATVATETRAVIAV^EKIPFVLGGNLQGGELWAYPYDMVRSLWKTQEHTPTPDDHVFRWLA 
YSYASTHRLMTDARRRVCHTEDFQKEEGTVWGASWHTVAGSLNDFSYLHTNCFELSIYVGCDKYPHESEL 
PEEWENNRESLIVFMEQVHRGIKGIVRDLQGKGISNAVISVEGVNHDIRTASDGDYWRLLNPGEYWTAK 
AEGFITSTKNCMVGYDMGATRCDFTLTKTNLARIREIMETFGKQPVSLPSRRLKLRGRKRRQRG (SEQ 
ID NO: 2) 



FIGURE 1 



Gl'9*s!iOOl'5, 



0,:i: : SOi 



2/4 



underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



[AGGCTGTCCCACCCACCATCTGCACCCGCTGCAGCGCCCGCGCCCCTGTCCCGCGCCGT 
AGTCGTCATTTGTAGCCCGCCTGCCGCTCCCGGGGACCCGATCCTACCCTGGGTGCGGGG 
CAGAGCGGGCATGGCCCGTCTGGGGACCGCCTGCCCTGCGCTGGCGCTGGCCCTGGCACT 
TGTGGCGGTGGCCCTGGCTGGAGTCAGAGCCCAGGGCGCAGCCTTCGAGGAGCCTGACTA 
TTACAGCCAGGAGCTCTGGCGGCGCGGGCGCTATTATGGGCATCCGGAGCCTGAGCCGGA 
GCCGGAGCTCTTCTCGCCTTCAATGCA] TGAAGACCTTAGGGTGGAGGAGCAGGAACAGC 
AGGAGCCGCACCAGCAGGGCCACAGAACTCCCAAGAAGGCCATCAAGCCCAAGAAGGCTC 
CCAA [GAGGGAGAAGTTAGTTGCAGAGACGCCTCCACCAG] GTAAAAATAGCAACAGAAA 
AGGCAGAAGAAGCAAGAATCTTGAGAAAGCTGCCAGTGATGACCATGGTGTCCCTGTGGC 
TCATGAGGATGTCAGAGAGAGTTGCCCACCTCTTGGTCTGGAAACATTAAAAATCACAGA 
CTTCCAGCTGCATGCCTCCACATCGAAGCGTTATGGCCTGGGAGCCCACCGGGGGAGACT 
CAACATCCAGGCAGGCATTAATGAAAATGACTTTTACGATGGGGCTTGGTGTGCTGGTAG 
GAACGACTTGCATCAGTGGATCGAAGTGGATGCCCGGCGCCTGACCAAGTTCACAGGGGT 
CATTACCCAAGGAAGGAACTCTCTCTGGCTGAGTGACTGGGTGACATCCTATAAAGTCAT 
GGTGAGCAATGACAGCCACACATGGGTTACTGTGAAGAATGGATCTGGCGACATGATATT 
TGAAGGAAACAGTGAGAAGGAGATTCCTGTGCTCAATGAGCTGCCAGTCCCCATGGTGGC 
CCGCTACATTCGCATAAACCCTCAGTCCTGGTTTGATAACGGGAGCATCTGCATGAGGAT 
GGAGATCTTGGGCTGCCCACTGCCGGATCCTAATAACTATTATCACCGACGTAATGAGAT 
GACCACCACGGATGACCTGGATTTTAAGCACCACAACTATAAGGAAATGCGCCAGTTGAT 
GAAGGTTGTCAATGAAATGTGCCCCAATATTACCAGGATTTACAACATTGGCAAAAGCCA 
CCAGGGCCTGAAATTGTATGCGGTAGAGATCTCTGACCATCCTGGGGAACATGAAGTTGG 
TGAGCCCGAGTTCCACTACATCGCAGGGGCCCACGGCAATGAGGTTCTGGGACGAGAACT 
GCTGCTGCTGCTGCTGCACTTCCTCTGCCAGGAATACTCGGCGCAGAACGCACGCATCGT 
CCGCTTGGTGGAGGAGACTCGAATCCACATTCTACCCTCCCTCAATCCTGATGGCTATGA 
GAAGGCCTATGAAGGAGGTTCCGAGTTGGGAGGCTGGTCCCTGGGACGTTGGACCCATGA 
TGGCATCGATATCAACAACAACTTTCCGGATTTAAACTCGCTGCTCTGGGAGGCAGAGGA 
CCAGCAGAATGCCCCAAGGAAGGTCCCCAACCACTACATTGCCATCCCTGAGTGGTTTCT 
GTCTGAGAATGCCACAGTGGCCACAGAGACCAGAGCCGTCATCGCCTGGATGGAGAAGAT 
CCCGTTTGTGCTGGGAGGCAACCTACAGGGGGGTGAGCTGGTCGTGGCATACCCCTATGA 
CATGGTGCGGTCCCTGTGGAAGACCCAGGAGCACACCCCAACACCTGATGATCATGTGTT 
CCGCTGGCTGGCGTATTCCTACGCCTCCACTCACCGCCTCATGACAGATGCCAGGAGGCG 
AGTGTGCCACACGGAAGATTTTCAGAAGGAGGAGGGCACCGTCAATGGGGCTTCCTGGCA 
CACAGTGGCTGGAAGTCTAAACGATTTCAGCTACCTCCATACAAACTGCTTTGAGCTGTC 
CATCTACGTGGGCTGTGATAAATACCCACACGAGAGCGAGCTGCCGGAGGAATGGGAGAA 
TAACCGGGAGTCTCTGATTGTGTTCATGGAGCAGGTTCATCGAGGCATCAAAGGCATAGT 
GAGAGATTTACAAGGGAAAGGGATTTCAAATGCTGTCATCTCTGTGGAAGGTGTTAACCA 
TGACATCCGGACAGCCAGCGATGGGGATTACTGGCGTCTACTGAACCCTGGCGAATATGT 
GGTCACAGCCAAGGCGGAAGGCTTTATCACTTCCACCAAGAACTGCATGGTTGGCTATGA 
TATGGGAGCTACTCGGTGTGACTTCACCCTCACAAAGACCAACCTGGCTAGGATAAGAGA 
AATTATGGAGACATTTGGGAAGCAGCCTGTCAGCCTACCCTCCAGGCGCCTGAAGCTGCG 
GGGACGGAAAAGGCGGCAGCGTGGGTGACCCTGTCGGACACTTGAGACATACCCCAGACC 
GTGCAAATAAAAATCCACTCCAGTAGTAAAAAAA ( SEQ ID NO : 1 ) 
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Gene Sequence Structure 



327 bp 



Sequence Deleted 



422 bp 



Size of full-length 
cDNA: 2490 bp 




Targeting Vector* 
(genomic sequence) 

Construct Number: 2825 



Arm Length: 
5': 4 kb 
3': 1.5 kb 



5' arm 



LacZ-Neo 
Cassette 



V arm 



3' probe 




Tar&eting Vector 
Endogenous Locus 



* Not drawn to scale 



5 1 >GGCATGGCCCGTCTGGGGACC 
GCCTGCCCTGCGCTGGCGCTGGCC 
CTGGCACTTGTGGCGGTGGCCCTG 
GCTGGAGTCAGAGCCCAGGGCGCA 
GC CTTCGAGGAGC C TG AC T ATT AC 
AGCCAGGAGCTCTGGCGGCGCGGG 
CGCT ATT ATGGGC ATC C GGAGCC T 
GAGCCGGAGCCGGAGCTCTTCTCG 
CCTTCAATGCA<3 ' 
( SEQ ID NO: 3) 



5 ' > GAGGGAGAAGTT AGTTGC AGA 
G AC GC C TC C AC C AGGT AAC TTTTG 
CATCGGGCAGCCCGAGGGGGCGCC 
AGCGATCGTGGCACTCCAGGGGAC 
ACCTGGCTTCCAGTATGTTTTCTT 
GAGTGAGCCCAGCCAAAGTCCTGT 
GGTGCCTGTGTTATTCCCTAGAGA 
C T AC ATC TG AGC T AAGTTC AGC TT 
TCTCTCCCTGC<3 ■ 
(SEQ ID NO: 4) 
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